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Abstract: In this paper, we address the problem of estimating a multidimensional den¬ 
sity / by using indirect observations from the statistical model Y = X e. Here, e is a 
measurement error independent of the random vector X of interest, and having a known 
density with respect to the Lebesgue measure. Our aim is to obtain optimal accuracy of 
estimation under Lp-losses when the error e has a characteristic function with a polynomial 
decay. To achieve this goal, we first construct a kernel estimator of / which is fully data 
driven. Then, we derive for it an oracle inequality under very mild assumptions on the 
characteristic function of the error e. As a consequence, we get minimax adaptive upper 
bounds over a large scale of anisotropic Nikolskii classes and we prove that our estimator 
is asymptotically rate optimal when p £ [2,-|-oo]. Furthermore, our estimation procedure 
adapts automatically to the possible independence structure of / and this allows us to 
improve significantly the accuracy of estimation. 

Keywords and phrases: density estimation, deconvolution, kernel estimator, oracle in¬ 
equality, adaptation, independence structure, concentration inequality. 


1. Introduction 


Let Vfc = (Vfc 1 ,... k G N*, be a sequence of M'^-valued i.i.d. random vectors defined 

on a complete probability space and having an unknown density / with respect to the 

Lebesgue measure. Assume that we have at our disposal indirect observations given by 

Yp Xp T sp, k 1,..., Ti, (1) 


where the errors £p are also i.i.d. d-dimensional random vectors, independent of the Xps, with 
a known density q. 

The goal is to estimate the density / by using observations = {Yi,... ,Yn). By an 

estimator we mean any -measurable mapping / : M"' —?■ Lp(]R‘^). The accuracy of an 
estimator is measured by its I^p-risk 


TZr, 




:= E 


/-/ 
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, p G [l,+oo), 7^oo /,/ 


:=E/ 
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Here and in the sequel Ej denotes the expectation with respect to the probability measure E/ of 
the observations = iYi, ..., Y^) and ||( 7 ||j. is the Lp-norm of gf G Lr(M*), s G N*, r G [1,-boo]. 
We will also denote by g the Fourier transform of G Li(M^), defined by g{x) = f e^^^’^^g(x)dx, 
where < •, • > is the euclidean scalar product on R^. 


The aforementioned deconvolution model, which is more realistic than the density model 
(with direct observations), exists in many different fields and is the subject of many theoretical 
studies. In most of them, the main interest is to provide estimators which achieve optimal rates 
of convergence on particular functional classes in a minimax sense. For instance, the problem 
of minimax estimation in the deconvolution model with pointwise and L 2 risks was investigated 
by Carroll and Hall [7], Stefanski [36], Fan ([11], [12]), Pensky and Vidakovic [32], Butucea [3], 
Hall and Meister [18], Meister [30], Butucea and Tsybakov ([5], [6]), Butucea and Comte [4]. 
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Global density deconvolution was also considered under a weighted Lp-norm (defined with an 
integrable weight function) by Fan [12] and under the sup-norm loss by Stefanski [36], Bissantz, 
Diimgen, Holzmann and Mnnk [2] and Lounici and Nickl [26]. Whereas all the works cited above 
are in the unidimensional setting, the problem of deconvolving a multidimensional density under 
pointwise or L 2 loss has been addressed by Masry ([27], [28]), Youndje and Wells [38] and Comte 
and Laconr [8]. 

In the present paper the aim is twofold. First, we deal with optimal deconvolution of a 
multivariate density under Lp and sup-norm losses. Next, as in Lepski [23] (under sup-norm 
loss) and in Rebelles [34] (under Lp-losses) for the density model, we also take advantage of the 
fact that some coordinates of the X^s may be independent from the others, bnt in a unified 
way. 


Minimax estimation In the framework of the minimax estimation it is assumed that / 
belongs to a certain set of fnnctions S, and then the accuracy of an estimator / is measured by 
its maximal risk over S : 


TZp 

/,s 

:= sup77p 

_1 



/6S 



The objective here is to construct an estimator /* which achieves the asymptotic of the minimax 
risk (minimax rate of convergence) : 



X inf TZp 

/,s 


/n 

-I 


(^„,p(S), n ^ -Foo, 


where infimum is taken over all possible estimators. Such an estimator is called minimax on S. 

In this paper, we focus on the problem of minimax estimation over anisotropic Nikolskii classes 
of densities Nr^diP,L), see the definition in Section 2.3. Whereas the vector fS = {Pi,..., /3d) 
represents the smoothness of the target density, r = (ri,... ,rd) represents the index of homo¬ 
geneity. For the case where p is finite we will assume that the smoothness of / is measured 
in the same Lp-norm that the accuracy of estimation, that means rj = p for j = 1,... ,d. In 
the latter case, the vector r will be replaced by p in the notation of the fnnctional class. If 
Pj = Pq, Lj = Lq and rj = tq for all j = I,... , d, any function belonging to Nr^^diPo, Lq) is 
called isotropic function. 

In Comte and Lacour [8] it was shown that 


<fn,2{N2,diP,L)) n 2-+1, 


r := 



-1 


2Xj + I 


( 2 ) 


when the common density q of the errors (which is assumed to be known) satisfies 


d X. d 

Ai n (1+< m\ < A2n ( 1 +, vf € M'", 

i=i j=i 

for some constants Ai, A 2 , Aj > 0, j = I,..., d. Such a density is nsually called ordinary smooth 
of order A = (Ai,..., A^). 

Note that the latter result was proved in the one dimensional setting by Fan [12]. However, 
whereas Fan [12] provided an estimator whose construction depends on the smoothness param¬ 
eter P of the functional class N 2 ^i{P,L) (which is not known in practice), Comte and Lacour [8] 
proposed an adaptive strategy. Indeed, they have constructed a single estimator which is fully 
data driven and minimax on each class N 2 ^d{Pj L), whatever the nuisance parameter {P,L) in a 
large range. Such an estimator is called optimal adaptive over the scale {N2^d{P, L)}{i3,l)- 

2 













Lounici and Nickl [26] considered the problem of adaptive deconvolution of a univariate 
density under sup-norm loss and proved that 


^71,00 (iVoo,i(/3,^)) ^ 


n \ 2 t+i 
ln(n) J 


T := 


2\ + l 


( 3 ) 


when the common density q of the errors is ordinary smooth of order A > 0. Moreover, they 
provided an optimal adaptive estimator over the scale of Holder classes {A^oo,i(/3, 

It is worth mentioning that Fan [12], Lounici and Nickl [26] and Comte and Lacour [8], 
as in most of the aforementioned papers, considered also the case of errors having a common 
density whose Fourier transform has exponential decay, usually called super smooth. In the 
multidimensional setting, Comte and Lacour [8] showed that, in presence of super smooth noise, 
the rates of convergence on anisotropic Nikolskii classes (considered as classes of ordinary smooth 
densities) are logarithmic and achieved by a kernel estimator whose bandwidth depends only 
on the smoothness parameters of the noise. Thus, in the latter case, no bandwidth selection 
procedure is required to get adaptive properties. Note that Youndje and Wells [38] considered 
the problem of adaptive deconvolution of an isotropic density in the ordinary smooth case, 
namely the ’’moderately ill-posed” case in inverse problems. The results obtained in Comte and 
Lacour [8] under L 2 -I 0 SS generalizes considerably those of Youndje and Wells [38]. 

In the present paper, we deal with the problem of minimax adaptive deconvolution of an 
anisotropic density in the ordinary smooth case with Lp-risks, p E [l,oo]. The rates of conver¬ 
gence given in (2)-(3) are recovered from the results we obtain. Indeed, we provide adaptive 
kernel estimators which achieve the following minimax rates of convergence respectively: 


(^n,p(Yp_d(/ 3 , L)) X n 2r+i ^ yp e [ 2 , -^00); 

T 

/ U \ 2T-fl 

¥?n,oo(Y^,rf(/3,L)) X ( , T ^:=r ^+ [uj>c\ \ 


( 4 ) 

( 5 ) 


where r is given in (2), u := 


^J = l PjVj 


and X : = 



/ 3 ,- 


> 0 . 


Here, the optimality is a direct consequence of minimax lower bounds recently obtained by 
Lepski and Wilier [25]. As usually, these lower bounds hold under additional assumptions on the 
common density of the errors, see Section 2.4. Moreover, they proved that there is no uniformly 
consistent estimator on Nr^d{/3,L) under sup-norm loss if x < 0. Note also that, for p E (1,2), 
our estimator does not achieve the minimax lower bound on Yp^^(/3, L) they have found. Finally, 
we will not consider the case p = 1 since the results in Lepski and Wilier [25] show that there 
is no uniformly consistent estimator on L) under Li-loss. 


It is important to emphasize that minimax rates depend heavily on the dimension d. To 
reduce the influence of the dimension on the accuracy of estimation (curse of dimensionality), 
many researchers have studied the possibility of taking into account, not only the smoothness 
properties of the target function, but also some structural hypothesis on the statistical model. 
For instance, see the works on the composite function structure in Horowitz and Mamen [19], 
louditski et ah [20] and Baraud and Birge[l], the works on multi-index structure in Golden- 
shluger and Lepski [13] and Lepski and Serdyukova [24], and the works on the multiple index 
model in density estimation in Samarov and Tsybakov [35]. 

Below, we discuss one of the possibilities of facing to this problem in the framework of 
density estimation. The approach which has been recently proposed in Lepski [23] is to take 
into account the independence structure of the target density /, namely its product structure 
due to the independence structure of the vector Xi. 
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Organization of the paper In Section 2, we describe assumptions on the densities involved 
in the statistical model (1) and we recall the minimax lower bounds obtained in Lepski and 
Wilier [25] useful in this paper. In Section 3, we introduce the family of kernel estimators we use 
for our procedure and then we describe the selection rule that leads to the construction of our 
final estimator. In Section 4, we provide some oracle inequalities and, as consequences, minimax 
adaptive upper bounds under Lp-losses over scales of anisotropic Nikolskii classes. Further, we 
discuss the optimality of our estimator and the influence of the independence structure of the 
target density on the accuracy of estimation. Proofs of all main results are given in Section 5. 
Proofs of technical useful results are deferred to the Appendix. 

2. Assumptions on densities f and q 

2.1. Structural assumption on the target density 

Denote by Id the set of all subsets of {I,..., d}, except the empty set. Let ip be a given set of 
partitions of {1,... , d}. For all I G Id denote also I = {!,..., d}\I and |/| =card(/). We will 
use 0 for d}. Finally, for all x and I G Id put xj := {xi)j^j and, for any probability 

density g -.W^ ^ ]R_|_, 

9 i{xi) := / g{x)dxj. 

JrKI 

Assume that 9^ = 9 and that <70 = 1. Note also that // and qi are the marginal densities of Xij 
and Sij respectively. 

If P E fp is such that the vectors Xij, I gV, are independent then f{x) = Vx E 

W^. In the sequel, the possible independence structure of the density / will be represented by a 
partition belonging to the following set : 


q3(/) ■.= IVG^: fix) = n flixi), Vx E . (6) 

I I&V J 

Remark that ip(/) is not empty if we consider that 0 E ip, or that ip = {P} if the independence 
structure of / is known. The possibility of choosing ip, instead of considering all partitions of 
{1,... ,d}, is introduced for technical purposes. This is explained in more detail in Lepski [23], 
section 2.1, paragraph Extra parameters’’'. 

Finally, we endow the set ip with the operation ” o ” introduced in Lepski [23] : for any 
P,P' E^ 

VoV' := {ini'I GV, I' GV'} . (7) 

The use of this operation for the estimation procedure allows us to construct an estimator which 
adapts automatically to the independence structure of the underlying density. 

2.2. Noise assumptions for upper bounds 

Both the definition of our estimation procedure and the computation of the Lp-risk, p G (1, +oo], 
lead us to consider that the density q of the noise random vector ei satisfies following assump¬ 
tions. 

Assumption (Nl). Assume that, for any I gV oP^ iV,V') E *P x ip.- 
(i) if p = 2 , then 11^1];^ < +oo; 

(m) ifp G (2,+oo], then Ug/Uoo < +oo. 
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Assumption (N2). Assume that, for some constants A > 0, Aj > 0, j = 1,..., d, one has 
for any I £V o V, {V, V) G ^ x 

{i) ifp = 2, 


|fKi)| > A-in(l + t2) 'i', VtGM'"; 
j&i 

{ii) if p G (l,+oo)\{2}, qi{ti) / 0, Vt G ^ G and 

[D"^Qi ■'] (i/)nc < Aj] (1 + ii)^ , Vt G M'', Vo/ = G N'^', < |/|; 

j&I j£l jGl 

(in) if p = + 00 , qi{ti) 7 ^ 0, Vt G qi G (rI^I) and 

I [£»“'= g/-!] iti)\ <All{l+t])^, VtGR'", VfcG/,VafcG{0,l}. 
j&l 

Here and in the sequel, D‘^'^g denotes the a^th order partial derivate of g with respect to the kth 
variable, D^g = g and, for any multi-index a = (oi, ... ,as) G N®, D^^g denotes the derivative 
DfK..Df‘‘g oi g :W 

Assumption (Nl) is satisfied for many distributions like centered Gaussian, Cauchy, Laplace 
or Gamma type multivariate ones. Assumption (N2) is quite restrictive since it does not hold 
for the classical Cauchy and Gaussian densities, whose characteristic functions have exponential 
decay. However, it is verified by the centered Laplace and Gamma type distributions, whose 
characteristic functions have polynomial decay. As mentioned in Comte and Lacour [ 8 ], the 
latter case keep a great interest in particular physical contexts; see, for instance, the study of 
the pile-up model in Comte and Rebafka [9]. 

In what follows, we assume that q satisfies Assumptions (N1)-(N2). 


2.3. Smoothness assumption on the target density 


In the literature there are several definitions of the anisotropic Nikolskii class of densities which 
are equivalent. Let us recall the definition we use in the present paper. Set {ei,... ,es}, the 
canonical basis in R®, s G N*. 

Definition 1. Assume that r = (ri,...,rs) G [l,+oo]®, /3 = (/3i,..., /3s) G (0, + 00 )® and 
L = (Li,...,Ls) £ (0, + 00 )®. A probability density gr : R® — )• R+ belongs to the anisotropic 
Nikolskii class Nr^si/3,L) if 


Dh 


< Lj, VA: = 0,..., [f3j\ , Vj = 1,..., s; 


{ii) Df^^g{- + ze,)-D\^^^g{-) 


< Lj , Vz G R, Vj = 1,..., s. 


Here and in the sequel, [aj is the largest integer strictly less than the real number a. Fur¬ 
thermore, we use the notation Ar,s(/3, L) for Nr^s{l3, L) when r = (r,..., r). 

In order to take into account the smoothness of the underlying density and its possible 
independence structure simultaneously, a certain collection of anisotropic Nikolskii classes of 
densities was introduced in Lepski [23], Section 3, Definition 2. However, since the adaptation 
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is not necessarily considered with respect to the set of all partitions of { 1 ,..., d}, the condition 
imposed therein can be weakened. For instance, if fp = {0} (no independence structure), we 
want to find again the well known results concerning the adaptive estimation over the scale of 
anisotropic Nikolskii classes of densities that is not possible with the classes intro¬ 
duced in Lepski [23]. For these reasons, the following collection {Nr,d(/3, was introduced 

in Rebelles [33], Section 3.1. 

Definition 2. Let r E [1, -|-oo]‘^ and (/ 3 , L, V) E (0, -l-oo)'^ x (0, -l-oo)'^ he fixed. A probability 
density (7 : —)■ R+ belongs to the class Nr^d (/3, L,V) if 

g{x) = X\gi{xi), VxEM'"; VI E R'o R", V (RMP") ^ ^ x ^3. ( 8 ) 

I&V 

Note that, if fp = {0}, the class N^^d {jd, L, 0^ coincides with the classical anisotropic Nikolskii 
class of densities Nr^d (fi, L). 


2.4- Noise assumptions for minimax lower bounds 


Recently, Lepski and Wilier [25] have obtained minimax lower bounds for ipn,p{Nj.^d{ld, L)), 
p E [1, -|-oo], when the density q of the noise random vector si satishes the following assumption. 

Assumption (N3). For any multi-index a = (oi,..., ad) E {0, 1}'^ satisfying ai + .. . + ad > 1, 
D°^q exists. Furthermore, there exist constants B > 0 and Xj > 0, j = 1,... ,d, such that: 

d X. 

(i) |g(t)| <B[](l + t 2 )-^, VtEM''; 

i=i 

{a) < B, ya = {ai,...,ad) e{0,iy, ai + ... +ad > 1 . 

Note first that Assumption (N3) is also verified for centered Laplace or Gamma-type distri¬ 
butions. Next, if fp = {0} (no independence structure), any density q that satisfies both the 
condition (i) of Assumption (N3) and Assumptions (N2) verifies 

d X ■ ^ X 

+ ^ |g(t)| <B[](l + t|)"^, VtEM'', 

i=i i=i 


and hence is ordinary smooth of order A = (Ai,..., A 2 ). Furthermore, the condition imposed 
in the left hand side of the latter inequalities, together with the condition (ii) of Assumption 
(N3) (or Condition 1 in Lounici and Nickl [26] for the one dimensional setting), implies that 
condition {Hi) of Assumption (N2) is satisfied. 

The results below follow from Theorems 2 and 3 in Lepski and Wilier [25] and allow us to 
assert the optimality of our estimators when ip = { 0 } (no independence structure). 

Theorem 1. Let Lq > 0 and p E [ 2 ,-|-oo) be fixed. Suppose that Assumptions (N3) is satisfied. 
Then, for any {fi,L) E (0,00)“^ x [LqjOo)'^ 


lim inf inf < ip. 

n—>-+00 X 




-1 

n,p 


{NUfi,L))np 


fn,N, 


p,d 


iP,L)]} 


> 0 , 


where infimum is taken over all possible estimators and ipn,p{Np^d{ld, L)) is given in (4). 
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Theorem 2. Let Lq > 0 and {P,L,r) £ (0,oo)'^ x [L(),ooY x be fixed. Suppose that 

Assumptions (N3) is satisfied. Then, 

{i) there is no uniformly consistent estimator if I — Yl'j=i '0^ — 0/ 

(n) if 1 - ^ > 0 

liminf inf |rf(/3, L))7^oo \fn, {fi, L)] | > 0, 
n—j'+cx) r L ’ L J J 

In 

where infimum is taken over all possible estimators and <^n,oo(-^r,(i(/3, T)) is given in (5). 

3. Estimation procedure 

In this section, we construct an estimator following a scheme of selection rule introduced in 
Lepski [23] to take into account the possible independence structure of the underlying density. 
If ^ = {0} this scheme coincides with a version of the methodology proposed by Goldenshluger 
and Lepski [15]. This methodology, employed in many areas of nonparametric statistics, has 
been recently used by Comte and Lacour [8] in the framework of the deconvolution model. 

3.1. Kernel-type estimators 

Let K : M —)• M be a fixed symmetric kernel (J" K = 1) belonging to the well known Schwartz 
class S(]R). For instance, K may be a Gaussian kernel. For all I G h G (0, l]'^ and x G 
put 

Kj{xi) := nK(x,), Khfixi) := K(x,//i,), Vn, := 

j£i jei j&i 

Therefore, in view of the definition of both the kernel K and Assumption (N2) on the errors, 
one can define the kernel-type estimator 

The ideas that led to the introduction of the estimators fhj are explained in Fan [12] in the 
one-dimensional setting and, in Comte and Lacour [8] in the multivariate context. 

Family of estimators Below we propose a data driven selection from the family of estimators 


^[^]-= \fih,V){x)=llfhfixi), xGMg ih,V)£np[^]\, 
[ lev J 

where the set ] of parameters {h,V) is constructed as follows. 

For I £ Id, consider first the set of multibandwidths 


( 10 ) 


^p,l := <hi £ 


h^'^- h^P^ 

mtn ’ '"max 


iFI 


: /,. = 2-’^\ ki G r, j G / F 


h^P'l. ■= 

min * 




n 


-1 


p £ (1,-Foo), 
p = -l-oo, 


h(p) ■= 

'^max • 

7 


[ln(n)] 1^1, pG(l,-boo), 
1, p = -|-oo. 








Then define 


np,i := { hj G Sjpj : 

j&l 


— *^pl{p<oo} T Y^ln(n)l {p=+oo} 


( 11 ) 


1 


1 


A (1 ), Cp :— 1 A < 


P 


P 


1 + Ar; 


2 V 


p 


p — 1 


— p[bp+A max\ 


) Ap 


The constant Cp is chosen in order to have Tipj / 0, Vn > 3. 
Put finally 


^p[ V ] := {(.h, V) G (0,1]'" X qj : hi G Tipj, V/ G P } . 


max Xj. 


The introduction of the estimator f(h,v) is based on the following simple observation. If there 
exists V G ip(/), the idea is to estimate separately each marginal density corresponding to 
I GV. Since the estimated density possesses the product structure we seek its estimator in the 
same form. 


Auxiliary estimators We mimic the procedure of Lepski [23] by introducing the following 
auxiliary estimators. Consider first the classical kernel auxiliary estimators 

fhj,r]j * fhj )) ^ € (0) f] ) 

where, here and in the sequel, ” ★” stands for the standard convolution product on M®, s G N*. 
Then put, for h,ri G (0,1]'^ and V, V G 

/(h,'P),(r),'P') (^) fhi,riiixi), 

leVoT' 

where the operation ” o ” is defined by (7). 

The ideas that led to the introduction of the estimators f(h,'P),{r],'P')j based on both the 
operation and ”o”, are explained in Lepski [23], Section 2.1, paragraph ^'Estimation con¬ 
struction" . Note that the arguments given in the latter paper do not depend on the norm used 
in the definition of the risk and remain valid for estimation under Lp—loss. 


3.2. Selection rule 

For I G Id and h G (0, l]*^, define 




--1 II T 

UP \\L 


(^/) I 


n 


— A • —- 
'^J 2 


_ 1 
n 2 


UjGih/' " + \/ln(n) \\L(^hi)\ 


2p 

P+2 


. n 2 V^ln(n) l\jel ^ 


-Xi-i 


Put also Ap := djp [ Gp]^^^ , where d := sup-pgsp |P|, 


PG (1,2), 

p = 2, 

p G (2, Too), 

p = +00. 

















and 7 p > 0 is a numerical constant whose expression is given in Section 5.1 below. 
For h G (0,1]*^ and P G ^ introduce Up{h,'P) := supjg.p Wp(/i/) and 


Ap{h,V) := sup 

fePOGHpl'P] L 


f{h,P),{r],V') - f(v,V') ^ - ^pUpirj, V') 


Define finally {h,V) satisfying 


Ap[h,T^)-\-ApL4p{h,T ^)— inf Ap[h,T^)-\-ApL4p{h,T^) 


(12) 


(13) 


Our selected estimator is / := fQ^py 

Note first that the existence of the quantities involved in the selection procedure is ensured 
by both the finiteness of the set Tip [ ] and the following result. The first statement given in 

Proposition 1 is a simple consequence of Marcinkiewicz Multiplier Theorem; see Theorem 5.2.4. 
and Corollary 5.2.5. in Grafakos [17]. 

Proposition 1. Assume that Assumptions (N1)-(N2) are satisfied. 

(i) For any p G (1, 2) and any / G P o V', {V,V') G ip x fp, there exists a constant Cpj := 
Cpj{\I\,K,q)>0 


LhAlp < Cpj n V"'’ € (O’ !]"• 

3&I 


(a) For any I G V oV, {V,V') G ^ x there exists a constant Cj := Ci{\I\,K,q) > 0 such 
that 

\\Lhfi\, < Cil[h-"^-K WLufiL < C/H ^ (O’l]"- 

j&i j&i 

The proof of this proposition is postponed to Appendix. It is important to emphasize that 
the first bound was not used for the definition of lAp{hi) since a dimensional constant is not 
explicitly done in Theorem 5.2.4. of Grafakos [17]. 

Next, we also emphasize that the quantity Up{hi) can be viewed, up to a numerical constant, 
as a uniform bound on the Lp—norm of the stochastic error provided by the kernel-type estimator 
fhj. This is explained by the following result. For I G h G (0, l]'^ and x G define 

^hiixi) := fhiixi) - E{fhj{xi)}. 


Proposition 2. Assume that Assumptions (N1)-(N2) are verified. Let I G V oV, {V,V') G 
^ X be arbitrary fixed. If p ^ (1, +oo], r > 1 and n > 3 then 


E sup 

hidHpi 



'^py{T)Up{hi) 


< Cp{v)n 


Cp(r) > 0. 


(14) 


The constants 7p,7(r) and Cp(r) do not depend on the sample size n. Their explicit expressions 
can be found in the proof of the latter result, which is also postponed to Appendix. 

Finally, in view of the assumptions on the kernel K, since Up [ ^ ] is a finite set, (/i, P) exists, 
is in 'Hp[ ^ ] and is measurable. It follows that / : M"" —)• Lp (M'^) is an measurable 

mapping. 
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4. Main results 


In this section, we first provide oracle inequalities for our estimator /. Then, we discuss adaptive 
minimax estimation over scales of anisotropic Nikolskii classes. 

4 . 1 . Oracle inequalities 

Note that the construction of the proposed procedure does not require any condition concerning 
the density /. However, the following mild assumption will be used for computing its risk; 


f €¥p[^] := <g €¥ : sup sup || 5 r/|| < 00 

l€PoV' 


where F denotes the set of all probability densities g : 
is determined by the choice of fp and in particular 


.. The considered class of densities 


{0} = G F : \\g\\p < ooj, Fp [ {P} ] = js- G F : sup \\gi\\p < 00 


Define, for {h,V) G Tip [ fp ] such that V G *P(/), 


T^p[ih,V),f] := \¥f sup sup -// ,pG(l,+oo), 

y V'G^i&PoP''' "P J 

TZoo[{h,V),f]:=EfSup sup Jhi - fi 

If the possible independence structure V of th^ target density is known, the latter quantity 
can be viewed as an ”Lp—risA:” of the estimator /(/i^-p), defined with the loss 


l(f(h,v)j) ■= sup sup fhi-fi 


In this case, we see that the effective dimension of estimation is not d, but d{V) := supjfz-p |/|. 
Therefore, the best estimator from the family fp ] (the oracle) should be f(h*,v*) such that 


np[{h*,V*)J] = ^ , inf^ 7^p[(^P),/]. 


Let us provide the following oracle inequalities for our selected estimator /. 

Theorem 3. Assume that Assumptions (N1)-(N2) are satisfied. 

If n > 3 and p G (1, + 00 ] then: V/ G Fp [fp], 

7^p[7,/l <Cp,i(fp) inf {np[{h,V)J]+jpUp{h,V)} + Cp, 2 {ip)n-K (16) 

where ip :=ly sup^sup^g^^p. H/zllp . 

The explicit expression of Cpq(fp) = Cp^i((i, ip, K, g, fp) and Cp^ 2 (fp) = Cp, 2 {d,^,'K,q,fp) 
is given in the proof of the theorem. It is worth to note that the maps fp 1 —)■ Cpq(fp) and 
fp eG Cp^ 2 (fp) are bounded on any bounded interval of M+. 

If ip = {0} we obtain automatically some oracle inequalities for estimation on under 
Lp— loss, without considering any independence structure. In this case, the result above can be 
improved. Indeed, by scrutinizing its proof, one can easily see that the following theorem is true. 
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Theorem 4. Assume that‘s = |0} and that Assumptions (N1)-(N2) are satisfied. 
If n > 3 and p £ (1, +oo] then: V/ £ F, 


IZr, 


fj 


< 


inf 

hen 


■p,0 


{( 


i + 2 ||iir|r 7^^ 


fhJ 




+ 2C, 


,n 


( 17 ) 


The explicit expression of the absolute constant Cp = Cp((i, K,^) > 0 is given in the proof 
of the theorem. 

Note first that the statement of Theorem 4 holds for all probability densities / £ F, that is 
not true for Theorem 3. Next, the constant 1 + 2 ||K||^ is more suitable than Cp^i(fp). Indeed, 
the prime interest in the oracle approach is to obtain a constant that does not depend on the 
target density and close to one. However, Theorem 3 allows us to consider both the smoothness 
properties and the independence structure of the target density and then to reduce the influence 
of the dimension on the accuracy of estimation. Indeed, if / has an independence structure V 
and the smoothness parameter h is fixed and properly chosen then our procedure should choose 
the true partition V and the estimator f(h,'P) should provide a better accuracy of estimation 
than the classical kernel-type estimator f^. This was illustrated by a short simulation study in 
Rebelles [34] for the density model (with direct observations), under the L 2 -I 0 SS. 


4-2. ILp-adaptive minimax estimation 

In what follows, we illustrate the application of Theorems 3 and 4 to adaptive estimation over 
anisotropic Nikolskii classes of densities (/3, L, P) and respectively. To compute 

an Lp-risk of a kernel-type estimator, we first compute its bias. Thus, we need to enforce the 
assumptions imposed on the kernel K. One of the possibilities is the following, proposed in 
Kerkyacharian, Lepski and Picard [21]. 

For a given integer I > 2 and a given symmetric function u : M ^ R belonging to the Schwartz 
class §(R) and satisfying J^u{z)dz = 1 set 

Furthermore we use K = u; in the definition of the collection of estimators 5^[^]. 

The relation of kernel ui to anisotropic Nikolskii classes is discussed in Kerkyacharian, Lepski 
and Picard [21]. In particular, it has been shown that 

[ K{z)dz = l, [ z’‘K{z)dz = 0, = (19) 

•/r -/r 


4.2.1. Minimax adaptive estimation under an L,p-loss 

For (/3,P) £ (0,+ 00 )*^ X fp define (j)n,p{l3,'P) := n~2^+i, where 

r := T{fi,V) = inf r/, tj : = 


^p + 1 

h 


- —i 


( 20 ) 


where bp is given in (11). 

Assume that 0 £ fp and consider the estimator / defined by the selection rule (12)-(13) with 
p £ (l,+oo). 
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Theorem 5. Let p E (l,+oo) be arbitrary fixed. Suppose that Assumptions (N1)-(N2) are 
satisfied. Then for any (/3, L, V) E (0, Z]'^ x (0, ooY x ^ one has 


limsup|(/>„p(/3,P)7^p f,Np^d{l3,L,V) | 

n—>-+oo ^ L -I 


< OO. 


To get the statement of the latter theorem we apply Theorem 3. If ^ = {0} (no independence 
structure), we obtain the following theorem by applying Theorem 4. 

Theorem 6. Let p E (l,+oo) be arbitrary fixed. Suppose that ip = {0} and that Assumptions 
(N1)-(N2) are satisfied. Then for any {I3,L) E (0, x (0,oo)'^ one has 


lim sup 

n—>-+oo 






f,Np,d 



< OO. 


To the best of our knowledge, the latter results are new. In view of the assertion of Theorem 1, 
ifp E [2, +oo) and Assumptions (N1)-(N3) on the errors are satisfied, we deduce from Theorem 6 
that (/>n,p(/3,0) is the minimax rate of convergence on the anisotropic Nikolskii class Np^d{fi,L) 
and that a minimax estimator can be selected from the collection of kernel-type estimators 
introduced in Section 3.1. IVtoreover, if ip = {0} (no independence structure), the quality of 
estimation of our estimator / is optimal, up to a numerical constant, on each class Np^d 
whatever the nuisance parameter (I3,L). Thus, in the aforementioned case, / is an optimal 
adaptive estimator over the scale {Np^d (/3, .I")}(/ 3 ,i,)- 

Remark that Lp-estimation of an anisotropic density in the deconvolution model does not 
require that this density is uniformly bounded, whereas it is imposed in all the works concerning 
the density model (with direct observations); see, e.g. in Goldenshluger and Lepski [15]. 

Unfortunately, if p E (1,2), our estimator does not achieve the minimax lower bound on 
Np^d ifi, L) obtained in Lepski and Wilier [25] under the Lp-loss. We conclude that either our 
estimator is not minimax on Np^d L) or the lower bound in Lepski and Wilier [25] is not the 
minimax rate of convergence on the latter functional class. 

It is important to emphasize that both Theorems 5-6 allow us to analyze the influence of 
the independence structure on the accuracy of estimation under an Lp-loss in the deconvolution 
model. Indeed, we see that 

</-„,p(/3,0) p(/3,iP), iP/0, 


whatever the independence structure of the common density of the errors. Thus, our estimation 
procedure allows us to improve significantly the accuracy of estimation if the target density has 
an independence structure R / 0. 

Having said that, the question is: is 4>ri,p{fi^T’) the minimax rate of convergence on the 
functional class Np^d (/5, L, V)! For the density model (that corresponds to Aj = 0, j = 1,..., cZ), 
it is asserted in Rebelles [34] that the answer is positive and that the proof of the corresponding 
minimax lower bound coincides with the one of Theorem 3 in Goldenshluger and Lepski [16], up 
to minor modifications to take into account the independence structure. For the deconvolution 
model, we conjecture that the answer is also positive if p E [2, -|-oo) and that a minimax lower 
bound on Ap ^ {fi,L,'P) can be obtained, up to straightforward modifications, as in Lepski and 
Wilier [25]. 


4.2.2. Minimax adaptive estimation under sup-norm loss 

Y 

For (/3, r, V) E (0, +oo)‘^ x [1, +00]^ x qj define fin ,00 (fi, U V) := , where 

T := T(/3,r,iP) = inf T,, T, := + [cu/x,]-^)’' , 

lev \ J 
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Ti := 


H -1 


E 


2 Xj + 1 


uji : = 


-1 


^ 2Xj 


+1 


i-E 


>€i : = 


j&l Pi 


E 


( 21 ) 


j&l Pi 


Assume that 0 G ^ and consider the estimator / defined by the selection rule (12)-(13) with 
p = + 00 . As previously, we obtain the following two theorems: 

Theorem 7. Suppose that Assumptions (N1)-(N2) are satisfied. Then for any {l3,L,r,'P) € 
(0, l]'^ X (0, oo)'^ X [1, +oo]'^ X ^ satisfying 1 — Ej=i > 0 one has 


lim sup < (p, 

n—>-+00 ^ 


-1 


n,ooil3,r,V)TZp 




< oo. 


Theorem 8. Suppose that = {0} and that Assumptions (N1)-(N2) are satisfied. Then for 
any (/3, L, r) G (0, l]^ x (0, oo)'^ x [1, + 00 ]“^ satisfying 1 — Ei=i > 0 one has 


limsup|(()„_^(/3,r,0)7^p f^Nr^ifi^L) \ 

n—¥-\-oo ^ L -I 


< 00 . 


To the best of our knowledge, the latter results are also new. Note first that in the case of 
direct observations we find again the results obtained in Lepski [23]. Next, if ip = {0} and 
1 — EEi > 0) it follows from Theorems 2 and 8 that, in presence of the noise satisfying 

Pj^j _ 

Assumptions (N1)-(N3), is the minimax rate of convergence on the anisotropic 

class Nr^d (/3, A). In this case, our estimator is an optimal adaptive one over the scale 


Nr,d (/3, L), (/?, L, r) G (0, if x (0, oo)'" x [1, +oo]^ 1 - ^ ^ > 0 

j=i PjP 

It is worth to note that our estimator can be used for pointwise estimation. Moreover, it follows 
from Theorem 8 that our estimator achieves the adaptive rates of convergence found in Comte 
and Lacour [8] with a pointwise criterion over the scale of Holder classes {Aoo,d (/3, 

As previously. Theorems 7-8 allow us to conclude that our procedure leads to a better accuracy 
of estimation under sup-norm loss whenever the target density has an independence structure 
V 9. In this case, we improve significantly the results obtained in Comte and Lacour [8] under 
a pointwise loss. Furthermore, we emphasize that if the target density has a known independence 
structure = {V}, I — EEi < 0 1 “ EEt > 0, VI G V, our estimator achieves 

J- PjTj PjVj 

the rate of convergence </'n,oo(/3, r, P) on Nr^d {fi, L,'P) whereas there is no uniformly consistent 
estimator on Nj.^d (/3, A). 

Finally, we conjecture that 4>n,oci/3, V) is the minimax rate of convergence on Nr^d (fi, A, V) 
when I — y'Ei P— > 0 and that a proof of the corresponding lower bound can be obtained 
by a minor modification of that in Lepski and Wilier [25] to take into account the possible 
independence structure of the underlying density. 



5. Proofs of main results 

5.1. Quantities and technical lemma 

For brevity, introduce first 

:= {/ G VoV', {V^V') G ^ X fp}. Up := sup sup sup Up{hi) < 00 , 

nCN* /GX* hi&Hpj 
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Note that the finiteness of Up is due both to the definition of the sets of multibandwidths Ti-pj 
and to the bounds given in Proposition 1. 

Next, define the constant 7 ^ involved in the selection rule. For / G and r > 1, put 


7p,/(r) := < 


4+r/%^, 


m 


7(7/ + 3A(27r)“^ 


46c(p)[pVe] ^ p 2 


Kigi 


WQiWU^, 

00 2 / 


3e 


MlVC/](lV||g/||^)2 


( 1 , 2 ), 

p = 2, 

p G ( 2 ,+ 00 ), 


^ 6(7/(K,g)(lV||gr/||^)2 [93|/|ln(|/|)+69r], p =+ 00 , 


— 1 . 

where gi{ti) := (l +i|) ^ , Ci := A |(27r)"— 
definition of T-Lpj, c{p) := 15p/ ln(p) and 


Kigi 


V 


Kigi 




Cp is given in the 


Ci{K,q) := 


(27r)^ 


Kigi 


V 


Kigi 


V 


( max D^Kjgj 


V 


Kppi 


V 


1 V 

with := supjg/ \tj\gi{tj). Then, put := kpl^p^^y + A:l{p=+oo}, ^ > 1, and 

SUpp^p^gqj {7p,/(r4)} , ^ / W, 

7p,0(ri), "P = W- 


Kupi 


Ip ■= 


Finally, we need the following technical lemma in order to compute our risk bounds. Dehne 


:= sup sup 
Ip := t)2 ||K||f \Gp 




/ r_ ^ '1 \ 0—1 


(^max|(7p, K ^fpjj , fp := 1 V 

sup WfiWp 

J 


Lemma 1. Assume that ^ 7 ^ {0}. Set r G {ri,r 2 ,r 4 }. Under Assumptions (N1)-(N2), if 
p G (1, + 00 ] then, for all integer n > 3, 

(EylCpD^ < Cp,i(r)n-T (E;|fp|’')" < Cp,2(r,fp), V/G Fp [ qi ] . 

The absolute constants Cp^i(r) > 0 and Cp^ 2 (r;fp) > 0 can be explicitly expressed and the maps 
fp !-)> Cp^ 2 (r 5 fp) are bounded on any bounded interval of M+; see the proof of the latter result, 
which is postponed to Appendix. 


5.2. Oracle inequalities : proof of Theorems 3 and f. 

1) Set p G [1, + 00 ] and / G Fp [ ip ]. Let {h,V) G T-Lp\_ ip ], P G ip(/), be fixed. 
In view of the triangle inequality we have 


/-/ 


< 


hh,V) hh,V),(h,V) 


+ 




(h,V),(h,V) 


— f(h,V) + f{h,V) — f 


< Ap{h,V) + ApUp{h,V) + Ap{h,V) + ApUp{h,V) + f(h,p) ~ f 
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Here we have used the equality — f(hv) (hv)' definition of (/i,P), we obtain 


/-/ 


< 2 


Ap{h,V) + ApUp{h,V) 


+ 


f(h,V) ~ f 


( 22 ) 


2) Suppose that V = {h, ..., Im}, m G {1,..., d}. Since V G fp(/), for any x G 


f(h,v){x) - fix) 


n fhiixi) - n 


lev 


lev 


^ I n II n i/h(®h)i) • 




d=ij-i 


Here we have used the trivial equality; for m G N* and a,-, 6,- G M, j = l,m, 


m m 


n ~ \ ri 11 n ’ 

j=i j=i j=i 


(23) 




d=ij-i 


where the product over empty set is assumed to be equal to one. 

In view of P G the triangle inequality and the Fubini-Tonelli theorem (used for the case 
p < oo) we establish 


f{h,v) - / „ < " /b ( n 


p “ 11 

j=i 


fh 


Ik 


n lift 


l Wp 


< m (max {Gp, fp})™ ^ sup 

lev 


/c=j+l,m / —1 

Ihi - fl I 


Since 


|K||i > J K = 1. Remind that c) = supp^qj |R| and Gp > 1. It follows 


fih,v) - f (max {Gp, fp})® ^ sup fh^ - fj 


lev 


(24) 


3) For any (ry, V') G Rp ^ and any x G 


f{h,V),{ri,V')ix) - f{ri,V')ix] 


n n ^Plnl'. * J'hj^p ixinr) fripixp 


\i'eV' Iev-.ir\i'^9 I'eV 

Therefore, by the same method as the one used in step 2, we establish 


f{h,V),{v,V') - fiv,V') 

< h [ Gp ] ®^® sup 

11 f^iniePini' 


p I'teV' 

leV-.ini'^^ 


(25) 

Here we have used Young’s inequality and the inequalities ||K||j^ > /K = l and Gp > 1. 

4) In view of the Fubini theorem and Young’s inequality, for any / G and any tj G (0,1]'’* 


E, 


{/p,(-)}|| =||R,,*/7||p<||R7||i 


< IIKII f 

p ^ ‘-p- 


(26) 
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Then, by the same method as the one used in step 2 and (26), for any (r/, V) E Up [ ] and 

any /' € V' we get 


i£V:ini '^9 leV-.ini'^V) 


< 


< 


, 0-1 


c) (^maxjGp, ||K||^fp|^ 
c) ||K||^ ^max |(?p, ||K||^ fp|^ 


sup 


^Vinl' * {fhinl' - finr) 


0-1 


sup 

/eP:/n/Y0 




(27) 


5) For rj E (0,1]*^ and I' E X^, since X E ^(/), we have for any x E 


E, 


{/%, (aJ/')} 


= ^Vi'(yi'~ n fini'iyini')(iyr = % |/?7jn/'(^^n/')} • 

lev-.ini'^m lev-.inr^m 


Here we have used the product structure of the kernel K and the Fubini theorem. 

Thus, in view of the triangle inequality, (25), (27) and the trivial inequality [sup^Xj — 
supi yi]+ < supjxj - yi]+, for any (ry, X') E [ fp ], we get 


f{h,v),{v,'P') - f{v,V') 


-kpUp{y,V') 




sup 

v&V 


/GP:/n/Y0 /GP:/n/V0 ^ 


f{h,V),{r],r') - f{v,P') ~ 


< fp sup sup 
J+ V'&f^I&VoV 


fhi - fl 


T fpCp) 


since fp > h > 1- We deduce 


Ap{h,T’) < fp sup sup 
V'e.^ I&VoV' 



fi + fp^p- 


(28) 


Finally, it follows from (22), (24) and (28) 



_ 

f 


/-/ 

p < 3fp < 

sup sup 
P'&^I^PoV 

fhi - fl 


p 


+ 7p^p(^) '^) + Cp 


(29) 


6) Consider the random event Bp 
Put also 




- {a„ > Cj}, Cj(f,) = (1 + 7pMp + ||K||f fp) ||K||; + 1. 


( E f sup sup 
P'GT iGPoV 




r > 1. 


In view of (26), Lemma 1, Markov’s inequality, (29), and the Cauchy-Schwarz inequality we 
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get Bp C > l}, [Ff (Bp) ] ■■4 < Cp^i{r 4 )n and 

Ib^) < 39" ||K||f [Cp(fp)]®'-lf7^(^l) [ih,V),f]+-fpUp{h,V) + 


(e/ 

f-f 

(% 

f-f 


n 


^ <^Cp^i{n)cp,2{r4,ip) (jl^p^'^[ih,'P),f]+'ypUp + Cp^i{r2)^n 


[ih,V)J] < Cp,i(r2) + jpUp + ||K||f fp + fp. 

Thus, we come to the assertion of Theorem 3 with Cp^i(fp) := 39^ ||K||^ [Cp(fp)]®^“^ and 

Cp,2(fp) := 3cp,i(r4)cp,2(r4, fp) + (1 + l|K||?)fp + 2Cp4(r2)^ 


0^-1 


+3cp,i(ri)9"||K||f[Cp(fp)] 

since TZp^"^ [{h,V),f] = TZp[{h,V), f]. The constants Cp^i(rfc) and Cp^2(rfc) fp), k = 1,2,4, are 
given in the proof of Lemma 1 . 

7) Particular case: fp = {0} (no independence structure) 

Set / G F and let h G ^p0 be arbitrary fixed. By scrutinizing the steps l)-5) we easily see 
that 

"f-f <(1 + 2||K||^) l-f +27^0(ri)i^p(/i) + 2[||a||p-7p,0(ri)Z^pW 


Thus, we get from Proposition 2 




f-f 


ri \ ri 


< (1 + 2||K||“) E/ 


fh- f 


ri \ 


+ 27 p, 0 (ri)Z^p(/i) + 2cp(ri)n 


where the constants 7p0(ri) and Cp(ri) are given in the proof of Proposition 2. 


5.3. Adaptive minimax upper bounds: Proof of Theorems 5-8 

1) Casepe (l,+oo); let (/3,L,P) G (0,/]'^ x (0, oo)'^ x fp and / G Np^d{P,L,V) C Ep [ ^3 ] be 
arbitrary fixed. 

In view of the triangle inequality, V/i G (0,1]*^, 


sup sup 
P'G'P JeVoV 




< sup sup 
•P'G'P JeVoV 


^f{fhA-)}-fj 


+ sup sup 
P'G'P JeVoV 


IIC/ijIlp 


(30) 


where Ef{fhj{xj)} = Khj-k fj{xj) and, remind, Chj{xj) := fhj{xj) - Ef{fhj{xj)}. 

Note first that, by applying Proposition 3 in Kerkyacharian, Lepski and Picard [21], it is 
easily established that, for any h G (0,1]'^, any P' G fp and any J G P oV', 


\Khj* fj - /j||„ < 


jeJ jeJ j&i 


c > 0. 


(31) 
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Next, if {h^V) E T-Lpl fp ], we easily get from Propositions 1-2 


Ef sup sup 
, V'&?J&VoV' ^. 


< O sup 




Consider now, for all I gV, the system 


'^j - H - 


n n g/ h' 




, j,kG I. 


( 32 ) 


The solution is given by 

hj=n~^^, jel, lev, (33) 

where tj is given in (20). 

Note that, for all I G V, hj G and > 1 for n large enough. 

Denote by hj the projection of hj on the dyadic grid Tipj- It is easily checked that {h,V) G 
^p[ ^ ] for n large enough. Thus, it follows from Theorem 3, (30), (31) and (32) that 


TZr, 




< C 


sup h‘ 


+ sup 
lev 




jei V 


+ ap,2n C > 0, (34) 


for n large enough. Finally, it is easily seen that we get the statement of Theorem 5 from (33) 
and (34). Similarly, the statement of Theorem 6 is obtained by applying Theorem 4. 


2) Case p = -|-oo; let (/3, L, r, V) G (0, x (0, ooY x [1, +oo]'^ x fp such that 1 — > 0 

•J ^33 

and / E Nj.^d{l3, LjV) be arbitrary hxed. It follows from the dehnition of the latter functional 
class and the embedding theorem for anisotropic Nikolskii classes, see, e.g., Theorem 6.9 in 
Nikolskii [31], that Nr^a iP, L,V) C Fqo [ ^ ], since 1 - Y^jei > 0, VI E Id- 

Note first that, in view of the arguments given in the proof of Theorem 3 in Lepski [23], 
it follows from Lemme 4 in the latter paper that, for any h G (0,1]'^, any V G ^ and any 
J GVoV', 


\\Khj*fj - fj\\^<csupy2hf^^^^ C := c{K,d,l,L) > 0, (35) 

Pj{I) := a{I)Piaj\l), a{I) ■=1-Y7 i^kPkV^ , := 1 “ X] {Pk^ ~ Pj^) ^k^' 

kei kei 


Next, if {h,V) G Hool V ], we easily get from Propositions 2 


E/ sup sup ||4/lloo^^ 
V'e^JeVoV 



ln(n) 


n Wjei 


h 


2Aj+l 

J 


( 36 ) 


Consider now, for all I gV, the system 


uPjD _ uPk{I) 

- klk 


i 


ln(n) 
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j, kG I. 



















The solution is given by 


hj — 



-Ci 1 

2Tj + l /3j-(/) 




(37) 


where T/ is given in ( 21 ). 

TT 2A '+1 

Note that, for all I £V, ^ ^ ln(n) for n large enough. Thus, as previously, we 

get the statement of Theorem 7 from Theorem 3, (35), (36) and (37). Similarly, the statement 
of Theorem 8 is obtained by applying Theorem 4. | 


6 . Appendix 

6 .1. Proof of Proposition 1 

Let h E (0,1]'^ and / E be arbitrary fixed. Note that 
1 


9,(«,):= n {1 + «j) ’ . (38) 

(27r)lb 7 r|/| gi(hiti)qi(ti) 

where hjtj denotes the coordinate-wise product of the vectors hi and tj. 

l)Proof of assertion (i) Set p E (1,2). Here, we apply the Marcinkiewicz Multiplier The¬ 
orem on given in Grafakos [17] p. 363, with 


Mti) = 9i {hiti)qi (ti). 

In view of Assumption (N2) on g, m is a bounded function defined away from the coordinates 
axes on and is on this region. Moreover, 


sup \m{ti)\ < A sup 


n ( 1 +«i)" n (^+) 

j^i j&i 


< 


A rift 


-^9 

j 


(39) 




Set aj = (oj )jei e satisfying |a/| := — |7|- In view of Leibniz’s rule, one has 


(tj) = Y. (V ) \ ll^7 [ {hiti) {ti), Vt E R'^. 

7 /<«/ ^ [ie/ J 

Here, 77 < aj means jj < aj, Mj^I. 

Let ti be chosen such that tj / 0 if aj / 0. In this case, for any multi-index < ai, 


Hhy [ [D^^gJ^] (hiti) [D'^^-^^qiY (ti) 
. 16 / 


n (^1^1. 
. 16 / 


7j 


[D^^gY (hiti) [D'^^-'^^qiY (ti) \ I m t 


-«9 

1 


. 16 / 


Vl 6 / 


Here, we assume that 0*^ is equal to one. 
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Since q satisfies Assumption (N2), we obtain similarly as in (39) 


[D^^m] {ti)\<C{\Ilqi)A 





( 40 ) 


Ci\I\,Qi) ■= , max 


E 

7/<“/ 


aj 

II 


sup 

n/GRl^l 





< 00.(41) 


Put SAti) := Ki{ti)gi{ti), t E Since K E §(M), Sj E §(Ml^h is the Fourier transform of 
a function Si E §(mI^I) C Lp(Rl^l). As 

L(hj){xi) := \|j| [ e"*<*^’^^>m(t7)5/(M/)dt/, 

(27r)0l JrUI 

it follows from Corollary 5.2.5. in Grafakos [17], (39) and (40) 

||L(,^)||^ < 2Aq,|C(|/|, g,) max (p, (p - I)-!)®'"'||5,||^ n 

j^i 

where C\i\ < oo is a dimensional constant which is not explicitly done in the aforementioned 
result. Thus, assertion (i) of Proposition 1 is proved with 

Cpj := 2A|Cp|C'(|/|,g/)max(p, (p- 1)"^®'^' \\Si\\p^ . 


2)Proof of assertion (ii) Note first that 


\L 




(27r) '2 Khjqi ||^ < (27r) 1^1 ^ . 


In view of Assumption (N2) on the errors. 


Khi/qi 


< 


a 2 / Kiihiti)friii+t])^^dti 

Jm I I/, 


Khr 


jei / 16/ 

/d < aI[ \Kj{ui)\ll{l + u])^^/^dui]vdllf^ 

16/ / le/ 


Thus, assertion (ii) of Proposition 1 is proved with 


Cl := a{(2 


10 . 
TT) 2 


Kjgi 


V 


Kigi 




where gj is given in (38). 
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6.2. Proof of Proposition 2: case p < oo. 


Let / € be arbitrary fixed. We get the statement of Proposition 2 by applying Theorem 1 
and Corollaries 2 and 3 in Goldenshluger and Lepski [14] with s = p, X = P = u = t is the 
Lebesgue measure on rI^I, — •) and Ms(w) = ||^ < oo. Here, the 

i.i.d. random vectors are the Y^j's and their common density is fi^qi- By using the continuity 
property of it is easily proved that Assumption (Al) in the aforementioned paper is 

fulfilled. 

1) Case p E (1, 2). Let r > 1 and hj E Tip,! be arbitrary fixed. 

By application of Corollary 2 in Goldenshluger and Lepski [14], one has 


{lIC/i/llp > Up{hi)+z^ < ’ Vz > 0, Vn > 1, 


(42) 


where C/p(/i/) = 4np ||^ and A2 (/ij) = 37n ^ ||L(;j^)||^. 

By integration of (42) we easily get, for all integer n > 3, 


E 


UhX-Upihi)-Ap{hj)^/^^] ^ < r(r + l)[Ap(h,)]^e-^'"W 

-- II 

n 2 L 


< r(r + 1)7^ sup 

hj&'Hpj 


(Ml 


n 


where r(-) is the well known Gamma function. 
Note that, for all integer n > 3, 


Up{hi) + Ap(/i/)v^rln(n) < < 4 + 


(37e“^pr 
2-p 


n 


M|^(G)IL =: 7p,/(r)7/p(/i/). 


Since card(7Cp^/) < f 1 V -j^ j log 2 (n) 


Id 


, we obtain, for all integer n > 3, 


E sup 

hj&Hpj 


UhiL-p,iir)Up{hi) \ <Cp{r)n 2 , 


Cp(r) := 7[r(r + l)]-- sup sup sup |n ^[2 log 2 (n)]||L(;,^)|| | 

nGN*/exs/i/SPp,/ 


which is finite in view of Proposition 1 and the dehnition of the set Ppp- 

2) Case p = 2. Let r > 1 and hj E P-pj be arbitrary fixed. Here, we apply Theorem 1 in 
Goldenshluger and Lepski [14] but we compute differently the upper bound on the ” duaF 
variance cr^ by using the arguments given in the proof of Proposition 7 in Gomte and Lacour 
[8]. Indeed, we obtain 


< n ^ (27r) 


K, 


hi 


Ql 


[ 

Jr\ 




dtp < n ^ (27r) ||^||i 


K, 


hi 


qi 


2 


00 


since ||//||^ < ||//||i = 1. 
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Taking into account the latter inequality, the result of Theorem 1 in Goldenshluger and Lepski 
[14] should be 


jll^h^llp > Up{hi) + z^ < expj- 


Al{hi) + Bp{hi)z 


, yz > 0, \/n > 1, 


(43) 


Up{hj) = n 2 ||-L (;,^)||2 , 


6 


" ifi Il9/lli^ ^ 


(27r)l^l 

By integration of (43) we get, for all integer n > 3 
E 


Khj/qi +24n 2 ||L(;,^)|| 2 , Bp{hi) =-n \\L(hj) 


ll^/*/llp “ Up{hi) - Ap{hi)-sjrhi{n) - Bp{hi)Y\ii{n) 
< r(r + 1) {Ap{hi) + Bp{hi)Y 


< r(r + 1) ( 6 V ll^ll^ ) sup 

hj&'Hpj 


I Khj/qi 

Note that, in view of Assumption (N2) on the errors, 


+ \L 


{h 




Khj/qi 


< A 


Kigi 


) j. -a. 

i&i 


-A' 

3 


(44) 


where gi is given in (38). Thus, in view of Proposition 1, (44) and the definition of T~Lpj, for all 
integer n > 3, 


Up{hi) + Ap(/i/)vrhi(^ + Bp{hi)Y\n{n) 

+ A Kigi 


^ I 8r /48r 




(2vr) 


Id 


lib 

16/ 


= : 7p,/(r)d/p(h/). 

Finally, we obtain for all integer n > 3 


UhiWp-lpjiYUpihj) \ <Cp(r)n2, Cp(r) ;= [r(r + 1)] r 


J + 


6V||g/||i"^| Khj/qi ^ H^} [2log2(n)] n ^ 


E sup 

hid'Hpj 


X sup sup sup 
n£N* leX^hi&npj . 

which is finite in view of Proposition 1, (44) and the definition of the set Tipj. 
3) Case p > 2. Let r > 1 and hj G Bpj be arbitrary fixed. 

By application of Corollary 3 in Goldenshluger and Lepski [14], one has 

.v2 


|||?hj||p > Up{hi) + zj < exp |- 


Aj{hi) + Bp{hi)z 


yz > 0, Vn > 1, (45) 







































Alihi) = 16c{p)\\qi\\l,\n ^\\L^hj)\\ji^+n ^ \\L^hj)\\ 2 \\hhj)\i +^ \\^ihi)\L\, 


2 3 

_2 _ 

P+2 

15p 


Bp{hi) = -c{p)n = 

Here, we have used the following inequalities, which are consequences of Young’s inequality. 

||//*97|loo < II//II 1 hlWoo < lk/lloo> 

i_i 

2 p 

00 


fl*Ql <\\fl*Ql\\^ ^ \\fl*Ql\\i<\\Ql\\ 

p 

By integration of (45) we get, for all integer n > 3, 

E UhiWp - Up{hi) - Ap{hi)s/rln{n) - Hp(/i/)rln(n) 

< r(r + 1) {Ap{hi) + Bp{hj)r 

< r(r+ 1) |6c(p) (l\/\\qi\\ 


3 

1^ 


X sup U\L(^hi)\\^ + 

hi£Hp,i I P+2 




In view of Proposition 1, we get 


l|r+,)IL £ l|r+,)lll:' l|r«.,)llJ £ O'*. 


-Y-5 


16/ 


Thus, in view of Proposition 1, (46) and the definition of T~Lpj, for all integer n > 3, 


Up{hj) + Ap{hI)^yrln{n) + Bp{hi)rln{n) 


< cj “(1 Vc ,) <16 c ( p )||«||^-+ 


,p 2 


i_i 

1 2 p 


xn 2 


Ylh- ^ " + VHn) \\L(hj)\\^ 

16/ 


P+2 


:= 'fpj{r)Up{hi). 

Finally, we obtain for all integer n > 3 


(46) 


E sup 


UhiL-Jp,i{r)Up{hi) } <Cp(r)n 2 , Cp(r) := 6c(p) [r(r + 1)] ■ 


X sup sup sup 
nSN* /ex^ hi&'Hpj 


1 V Ik/||4)){ \\L{hi)\\^ + \J\\B{hi)\\2 ll^(^/)llp 

+ ll^(/*/)llp}™”'[(lV^)log2(n 

which is finite in view of Proposition 1, (46) and the definition of the set T~Lpj. 
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6.3. Proof of Proposition 2: case p = +oo. 


Let n > 3, I G ^ be arbitrary fixed. Assume that > ln(n). 

We divide this proof into several steps. 

1) Preliminaries: First, since q satisfies Assumption (N2) and the Lfc,/’s are i.i.d. random 
vectors with density fi-^^qi, we get from Proposition 1 


sup sup \L^hi){xi -yi)\ < \\L(hi)\\^ ^ 

x/GRl-fl j/jGKl-f| 


<Ci(K,q)l[h 


-A,-l 


(47) 


Ci{K,q) := 


(2vr)V 


Kigi 


V 


Kigi 


V 1 max D]KjqT 
1 \jel ^ 


V 


Kiipi 


V 


Kupi 


where := sup^gj \tj\ giitj) and gi is given in (38); 


Xj 


grI^I 


sup^^ (lE|L(;,^)(x,-yi,,)|')" < JWfi^qiWo^WhhiA ^ \lhi\\ooCii^^<l)\{hr "• (48) 


-A,-i 


3^1 


Next, set xj and xj be arbitrary fixed in Rbl. For any tj E 


3^1 


g _ I I g 


i6/ 


< |/|sup|e-*b^^'| 

ie/ 

< |/| sup |tj| sup |xj — iCjl . 

je/ j£i 


Therefore, for any yj E M^l 

\L{hi){xi - yi) - L(h^){xi - yi)\ 


< 


(27r)l'f| jw\ 


KhAti) 


Qliti) 


^—i<ti,xi> _ ^—i<tj,xi> 


dtj 


<n\I\Ci{^,q)\{h 


3&I 


sup|xj —Xjl; 
3&I 


(49) 


IE |T(;j^)(x/ - Yiy) - L^hi)ixi - hi,/)!' 


2 \ 2 


< 


ll /j*g/|loo f 

(27r)bl Jm\i\ 


Khjiti) 




\^-i<ti,xi> _ g-*<L,a:/> |2 


<n\I\^l\/\\qi\\^Ci{K,q)llh 

3^1 

Consider now the normalized empirical process 
l.{xi) := C,(K,(7) 


—A —- 

-■ ^ ^sup|xj—Xjl; 
i6/ 


2(1 V lk;|I J I 
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( 50 ) 







































In view of Bernstein inequality, (49), (48), (49) and (50), Mz > 0, 


where A{xi) := 1, B{xi) := ^ < 1 and 


a{xi,xj) = h{xi, xj) := 2 A < n\I\ sup \xj — Xj\ 

j&i 


It is easily seen that a(-, •) is a semi-metric on 


(51) 

(52) 


(53) 


2) Supremum-norm over totally bounded sets: In this step we obtain bounds of the 
supremum-norm of the normalized empirical process (•) over totally bounded sets by applying 
Proposition 1 in Lepski [22] with T = RI^I, © = R, y = and \I'(-) = | • |. Then we have to 
check Assumptions 1, 2 and 3 required in the latter Proposition and to match the notations 
used in the present paper and in Lepski [22], 

Note first that, in view of (51), (52) and (53), Assumption 1 is fulhlled with c = 2. Next, 
consider the family of closed balls 


®«(i/) ■= \xi ^ R'^'l : supjxj —tj\ < R/2 > , i? > 1, t/ G 

' I ^6/ J 

In view of the continuity property of the Fourier transforms and the definition of the semi¬ 
metrics a and b, it is obvious that Assumption 2 is also satisfied with 0 = ]B_R(t/) , Aq = 1 and 

Be = (nn,„hf‘^Y. 

Let s : R —)• R+\{0} defined by s{z) := (0, 01 + Obviously X]fc>o < 1 and, for 

any z > 0, 

Rn\I\ 


^e,4zim)-\s{6)) < |/| 


In 


, V(5 > 0, 


(54) 


2;(48(5) ^s{5)J 

where (<5), > 0, denotes the entropy of 0 measured in a. Then, for any z > 0, there exists 

(5* > 0 small enough such that 

e^f'\z,Q) := sup(5“^(£e,a ('2(4:8(5 )“^s(( 5)) = sup (2(48(5 )“^s((5)) < oo; 


(5>0 


5>5, 


e^’^^(^,0) := supd ^(£e^b (-2^(485) ^s((5)) = sup J (-2(48(1) ^s((5)) < oo. 

( 5>0 ’ < 5 >< 5 * 

Thus, Assumption 3 in Lepski [22] is fulfilled and Proposition 1 in the latter paper can be 
applied. Let us compute the quantities which appear in this result. 

Choose s = {s,s), x = {2Aq,2Bq) and e = \/2 — 1. Since A© V Bq < 1 and a{xj,xi) = 
h{xi,xi) < 2, Vx 7 ,T/ G RI^I, we straightforwardly get 

e,-(x,0) := ei")(2Ae,0) + e('’)(2;Be,0) 

< sup 5“^(£e,a (2(48(5)“^s(5))sup (2(48(5)“^s((I)) 

< 5 > 0,61 5 > 0,61 


< 4,5|I|[ln(i?n|/|)]++8,5; 
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Ug\y,>c,Q) ■■= Xiv^2[l + e i]2e^(x, 0) + y + X 2 (2[1 + e ^feg{K,Q) + y) 

< , 2(31|f|ln(fin|/|)+59 + .,) 


2A,+1 


Thus, it follows from Proposition 1 in Lepski [22] that, for any y > 1 and any r > 1, 


sup \(hi{xi)\ -Ui^\y, >(,&)} <4r(r + l) 2y 


Xl& R itl) 
T 


,-lryP)/ 
s 


e 2 . 


( 55 ) 


3) Supremum-norm over the whole space: Let x/ G be arbitrary fixed and yi G 
be such that sup^gj \xj — yj\ > n. By integration by parts, we easily get 


\L(hi)ixi-yi)\ < 


maxjg/ 


D]{K,, 


^ Ci{K,q) ^ Ci{K,q) 


(27r)lh sup^g ^\xj -yj\ „ Yljei n Ui^i ’ 


(56) 


Be/ j 


in view of Assumption (N2) on the errors. 

Consider the collection of closed balls |B^(nj), j G Obviously this collection is a 

countable cover of RI^L Put, for any j G Zl^l, 


fj := / fi-kqi{xi)dxi, 

Mi) 


u 


(rek). 


kGZl-^l: Bn (nj)nBn (nk) 7^0 


It is easily checked that 




fi-kqi{xi) 


jezl^l 


dxj < 


(57) 


Set j G Zl-^l such that fj > n where u > 1 is specified later. If ?/ = 21n(l/fj) + (r + 1) ln(n), 
we get from (55) 


E<( sup |Ch,(a;/)l- tS/W 

xjGBn (nj) 


ln(n) 


<2“'+2r(r + l) 7 ^yr) fjn ‘ 2 % 


(v) 


r+1 




where := 4C'/(K,g)y2(l V |k/||oo)(93|/| ln(|I|) + 60 + 2'(; + r), since 

ln(n). 

Thus, in view of (57), we obtain 


> 


sup \Chj{xi)\-jY^r) 

xieBi 


ln(n) 




<2>+2+2|'lr(r + l)[7W (r) 


r+1 

n-—, (58) 


where ©i := Ujg 2 |/|,f.>„-.B|(nj). 
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Set j G such that fj < n and xj G IB:si(nj). In view of (42) and (56) we get, for any 
fe = 1, ..., n, 

^\L{hi){xi-Ykj)\ = E{|L(/i^)(x/- Yfc_/)| lB(j)(I4,/)} 

+^{\L{hj){xi - Ykj)\ lKlU\B(j)(^fc,/)} 


< P{yfc,/G]B(j)} 


C/(K,g) , CKK,g) 


+ 




< 


2C/(K,g) 

, 2A,+1 ’ 


since fj := P {Y^j G IB(j)} < n~'", u > 1 and sup^gj \xj — Y^jl > n when Y^j G 
Introduce random events 

^j:={ElB(j)(>)c,7)>2l, jGZl^l, D:= U Dj. 

Let D be the complementary to D. If D holds then, in view of (42) and (56), 

2Ci{K,q) 


^ ^J2\^(hi){xi-Ykj)\ 


< 


k=l 


nUjeihj 


2A,+1 


, Vx/ G 02 := 


Since Hie/from (59) and (60) 


Ife/ j 


sup \^hj{xi)\lD^^t]ii^) 
a:/G02 


ln(n) 




2A,-I-1 


and, taking into account that sup3,^g02 \^hi{xi)\ < 2C/(K,q)n, 

< [2CiiK,q)Y n^FiD). 


E 


sup \(hj{xi)\ - 

XI&02 


ln(n) 




^2A,. + 1 


(59) 


(60) 


(61) 


Let j G Zl^l satisfying fj < n ’'be arbitrary fixed. In view of Markov ineqnality one has for 
any z > 0 

P(Llj)<e"2^ ” < exp {-2z + n(e^ - l)fj} , 

since the Y^j's are i.i.d. random vectors. Minimizing the right hand side in z > 0 we obtain 

P(Zlj) < (e/2)2(nfj)2 < 2fjn2-’’. (62) 

Thns, choosing u = l,5r + 2,5, it follows from (57), (58), (61) and (62) 


E -7oo,/(r) 

where 7oo,/(r) := 7£E^’^^(r). 


ln(n 


\| "rije/'i, 


)_l, <2-+s+Vlr(r+l)h„,Krr"-‘^', (63) 
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Finally, in view of the definition of ^oo,/; 


E sup [IlChJIoo - 7oo,/(r)Z^oo(/i/)]l > < Coo(r)n 2 , 

Zi/GHoo,/ I 


(64) 


Coo(r) [r(r + 1)] >■ sup sup S 7oo,7(r) 


nGN*/GXJ 


ir+3+2|7| 


[log 2 (n)] r n 2 r ^ < 00 . 


6.4- Proof of Lemma 1 


Assume that ip / {0}. Set / € Ep [ ip ] and let r € {ri,r 2 ,r 4 } be arbitrary fixed. We obtain 
Lemma 1 by applying Proposition 2. We divide this proof into two steps. 


1) Note that 


Cp<Yl UhiWp-lpjirppihi) 


since 7 p^/(r) increase with r. In view of Proposition 2, if p G (1, + 00 ] and n > 3, 
(% < Cp,i(r)n“i, Cp,i(r) := d|<ppCp(r). 

2) For any p > 1 

Uhj\\p-%l{p{hi) ^+%Up{hi)+ E/j/ft^l 

l + ||K||f(|p + 7pWp + ||K||ffp), 


Gp < 1 + ||K||^ sup sup 

/gxj hieHpj 


< 


7 := sup 7 p 7 (r 4 C) 2 ), ^ := sup sup UhiL - 1p,l{r4d^)Up{hi) 

hidHp,! L 




r n 

t)2-i 


- 

Gp + ||K||^fp 

< 0^ ||K||p 

i + iiKii?(; 

»p + 7p^p + K p fp + fp^ 


fp < f'l|K||? 


Below we use the following trivial equality: for any random variable Y 


E. 




'E/iyp® 

In view of Proposition 2, if p G (1, + 00 ] and n > 3, (Ej |fp|‘ j < Cp^ 2 (r, fp) with 


(65) 


Cp,2(r,fp) := t)2 ||K||f [1 + ||K||^ (d\^\^Cp{rd^) +%Up + ||K||f fp + fp 
Thus, we finish the proof of Lemma 1. 
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